pancreatic cancer
Intercept Cancer: Cancer Pre-Screening with Large Scale Healthcare Foundation Models
Sun, Liwen, Yao, Hao-Ren, Gao, Gary, Frieder, Ophir, Xiong, Chenyan
Cancer screening, leading to early detection, saves lives. Unfortunately, existing screening techniques require expensive and intrusive medical procedures, not globally available, resulting in too many lost would-be-saved lives. We present CATCH-FM, CATch Cancer early with Healthcare Foundation Models, a cancer pre-screening methodology that identifies high-risk patients for further screening solely based on their historical medical records. With millions of electronic healthcare records (EHR), we establish the scaling law of EHR foundation models pretrained on medical code sequences, pretrain compute-optimal foundation models of up to 2.4 billion parameters, and finetune them on clinician-curated cancer risk prediction cohorts. In our retrospective evaluation comprising of thirty thousand patients, CATCH-FM achieves strong efficacy, with 50% sensitivity in predicting first cancer risks at 99% specificity cutoff, and outperforming feature-based tree models and both general and medical LLMs by up to 20% AUPRC. Despite significant demographic, healthcare system, and EHR coding differences, CATCH-FM achieves state-of-the-art pancreatic cancer risk prediction on the EHRSHOT few-shot leaderboard, outperforming EHR foundation models pretrained using on-site patient data. Our analysis demonstrates the robustness of CATCH-FM in various patient distributions, the benefits of operating in the ICD code space, and its ability to capture non-trivial cancer risk factors. Our code will be open-sourced.
- Asia > Taiwan (0.05)
- Asia > Middle East > Jordan (0.04)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- (5 more...)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
- (2 more...)
Early Detection of Pancreatic Cancer Using Multimodal Learning on Electronic Health Records
Aouad, Mosbah, Choudhary, Anirudh, Farooq, Awais, Nevers, Steven, Demirkhanyan, Lusine, Harris, Bhrandon, Pappu, Suguna, Gondi, Christopher, Iyer, Ravishankar
Pancreatic ductal adenocarcinoma (PDAC) is one of the deadliest c ancers, and early detection remains a major clinical challenge due to the absence of spec ific symptoms and reliable biomarkers. In this work, we propose a new multimodal appro ach that integrates longitudinal diagnosis code histories and routinely collected laborato ry measurements from electronic health records to detect PDAC up to one year prior to clin ical diagnosis. Our method combines neural controlled differential equations to model irregular lab time series, pretrained language models and recurrent networks to learn diagnosis code trajectory representations, and cross-attention mechanisms to capture in teractions between the two modalities. We develop and evaluate our approach on a real-world dat aset of nearly 4,700 patients and achieve significant improvements in AUC ranging from 6.5 % to 15.5% over state-of-the-art methods. Furthermore, our model identifies diagnosis codes and laboratory panels associated with elevated PDAC risk, including both established and new biomarkers.
- North America > United States > Illinois > Cook County > Chicago (0.05)
- North America > United States > Illinois > Champaign County > Urbana (0.04)
- Research Report > Experimental Study (1.00)
- Research Report > New Finding (0.93)
- Health & Medicine > Therapeutic Area > Gastroenterology (1.00)
- Health & Medicine > Health Care Technology > Medical Record (1.00)
- Health & Medicine > Therapeutic Area > Endocrinology > Diabetes (0.93)
- Health & Medicine > Therapeutic Area > Oncology > Pancreatic Cancer (0.72)
Opportunistic Screening for Pancreatic Cancer using Computed Tomography Imaging and Radiology Reports
Le, David, Correa-Medero, Ramon, Tariq, Amara, Patel, Bhavik, Yano, Motoyo, Banerjee, Imon
Pancreatic ductal adenocarcinoma (PDAC) is a highly aggressive cancer, with most cases diagnosed at stage IV and a five-year overall survival rate below 5%. Early detection and prognosis modeling are crucial for improving patient outcomes and guiding early intervention strategies. In this study, we developed and evaluated a deep learning fusion model that integrates radiology reports and CT imaging to predict PDAC risk. The model achieved a concordance index (C-index) of 0.6750 (95% CI: 0.6429, 0.7121) and 0.6435 (95% CI: 0.6055, 0.6789) on the internal and external dataset, respectively, for 5-year survival risk estimation. Kaplan-Meier analysis demonstrated significant separation (p<0.0001) between the low and high risk groups predicted by the fusion model. These findings highlight the potential of deep learning-based survival models in leveraging clinical and imaging data for pancreatic cancer.
- North America > United States > Arizona > Maricopa County > Phoenix (0.14)
- North America > United States > Florida > Duval County > Jacksonville (0.05)
- North America > United States > Minnesota > Olmsted County > Rochester (0.04)
- (2 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Health & Medicine > Therapeutic Area > Oncology > Pancreatic Cancer (1.00)
- Health & Medicine > Therapeutic Area > Oncology > Carcinoma (0.70)
Enhancing Pancreatic Cancer Staging with Large Language Models: The Role of Retrieval-Augmented Generation
Johno, Hisashi, Johno, Yuki, Amakawa, Akitomo, Sato, Junichi, Tozuka, Ryota, Komaba, Atsushi, Watanabe, Hiroaki, Watanabe, Hiroki, Goto, Chihiro, Morisaka, Hiroyuki, Onishi, Hiroshi, Nakamoto, Kazunori
Purpose: Retrieval-augmented generation (RAG) is a technology to enhance the functionality and reliability of large language models (LLMs) by retrieving relevant information from reliable external knowledge (REK). RAG has gained interest in radiology, and we previously reported the utility of NotebookLM, an LLM with RAG (RAG-LLM), for lung cancer staging. However, since the comparator LLM differed from NotebookLM's internal model, it remained unclear whether its advantage stemmed from RAG or inherent model differences. To better isolate RAG's impact and assess its utility across different cancers, we compared NotebookLM with its internal LLM, Gemini 2.0 Flash, in a pancreatic cancer staging experiment. Materials and Methods: A summary of Japan's pancreatic cancer staging guidelines was used as REK. We compared three groups - REK+/RAG+ (NotebookLM with REK), REK+/RAG- (Gemini 2.0 Flash with REK), and REK-/RAG- (Gemini 2.0 Flash without REK) - in staging 100 fictional pancreatic cancer cases based on CT findings. Staging criteria included TNM classification, local invasion factors, and resectability classification. In REK+/RAG+, retrieval accuracy was quantified based on the sufficiency of retrieved REK excerpts. Results: REK+/RAG+ achieved a staging accuracy of 70%, outperforming REK+/RAG- (38%) and REK-/RAG- (35%). For TNM classification, REK+/RAG+ attained 80% accuracy, exceeding REK+/RAG- (55%) and REK-/RAG- (50%). Additionally, REK+/RAG+ explicitly presented retrieved REK excerpts, achieving a retrieval accuracy of 92%. Conclusion: NotebookLM, a RAG-LLM, outperformed its internal LLM, Gemini 2.0 Flash, in a pancreatic cancer staging experiment, suggesting that RAG may improve LLM's staging accuracy. Furthermore, its ability to retrieve and present REK excerpts provides transparency for physicians, highlighting its applicability for clinical diagnosis and classification.
- North America > United States > Texas > Kleberg County (0.04)
- North America > United States > Texas > Chambers County (0.04)
- Asia > Japan > Honshū > Tōhoku > Miyagi Prefecture > Sendai (0.04)
- (2 more...)
- Health & Medicine > Therapeutic Area > Oncology > Pancreatic Cancer (1.00)
- Health & Medicine > Therapeutic Area > Gastroenterology (1.00)
- Health & Medicine > Therapeutic Area > Endocrinology (1.00)
In Silico Pharmacokinetic and Molecular Docking Studies of Natural Plants against Essential Protein KRAS for Treatment of Pancreatic Cancer
Kappan, Marsha Mariya, George, Joby
A kind of pancreatic cancer called Pancreatic Ductal Adenocarcinoma (PDAC) is anticipated to be one of the main causes of mortality during past years. Evidence from several researches supported the concept that the oncogenic KRAS (Ki-ras2 Kirsten rat sarcoma viral oncogene) mutation is the major cause of pancreatic cancer. KRAS acts as an on-off switch that promotes cell growth. But when the KRAS gene is mutated, it will be in one position, allowing the cell growth uncontrollably. This uncontrollable multiplication of cells causes cancer growth. Therefore, KRAS was selected as the target protein in the study. Fifty plant-derived compounds are selected for the study. To determine whether the examined drugs could bind to the KRAS complex's binding pocket, molecular docking was performed. Computational analyses were used to assess the possible ability of tested substances to pass the Blood Brain Barrier (BBB). To predict the bioactivity of ligands a machine learning model was created. Five machine learning models were created and have chosen the best one among them for analyzing the bioactivity of each ligand. From the fifty plant-derived compounds the compounds with the least binding energies are selected. Then bioactivity of these six compounds is analyzed using Random Forest Regression model. Adsorption, Distribution, Metabolism, Excretion (ADME) properties of compounds are analyzed. The results showed that borneol has powerful effects and acts as a promising agent for the treatment of pancreatic cancer. This suggests that borneol found in plants like mint, ginger, rosemary, etc., is a successful compound for the treatment of pancreatic cancer.
- North America > United States (0.93)
- Asia > India > Kerala (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (0.93)
- Health & Medicine > Therapeutic Area > Oncology > Pancreatic Cancer (1.00)
- Health & Medicine > Therapeutic Area > Gastroenterology (1.00)
- Government > Regional Government > North America Government > United States Government > FDA (0.68)
SwiftDossier: Tailored Automatic Dossier for Drug Discovery with LLMs and Agents
Fossi, Gabriele, Boulaimen, Youssef, Outemzabet, Leila, Jeanray, Nathalie, Gerart, Stephane, Vachenc, Sebastien, Giemza, Joanna, Raieli, Salvatore
The advancement of artificial intelligence algorithms has expanded their application to several fields such as the biomedical domain. Artificial intelligence systems, including Large Language Models (LLMs), can be particularly advantageous in drug discovery, which is a very long and expensive process. However, LLMs by themselves lack in-depth knowledge about specific domains and can generate factually incorrect information. Moreover, they are not able to perform more complex actions that imply the usage of external tools. Our work is focused on these two issues. Firstly, we show how the implementation of an advanced RAG system can help the LLM to generate more accurate answers to drug-discovery-related questions. The results show that the answers generated by the LLM with the RAG system surpass in quality the answers produced by the model without RAG. Secondly, we show how to create an automatic target dossier using LLMs and incorporating them with external tools that they can use to execute more intricate tasks to gather data such as accessing databases and executing code. The result is a production-ready target dossier containing the acquired information summarized into a PDF and a PowerPoint presentation.
- North America > United States (0.04)
- Europe > France (0.04)
- Health & Medicine > Therapeutic Area > Oncology (1.00)
- Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Mother, 62, is diagnosed with world's deadliest cancer years in advance thanks to artificial-intelligence-powered blood test: 'AI saved my life, I won the lottery'
Like millions of people in the US, artificial intelligence was just something Dianne Balon read about on the news. Little did she know the tech would come to save her life. Despite being a picture of health, an AI-powered blood test in 2022 revealed that one of the world's deadliest cancers was silently forming in Ms Balon's pancreas. It caught the tumor in its earliest form, before it had the chance to grow and spread, which is when the vast majority of pancreatic cancers are caught - at which point it's too late. The results of the test provided a key'piece of the puzzle'.
- North America > United States (0.26)
- North America > Canada > Alberta > Census Division No. 11 > Edmonton Metropolitan Region > Edmonton (0.05)
Evaluating Explanatory Capabilities of Machine Learning Models in Medical Diagnostics: A Human-in-the-Loop Approach
Bobes-Bascarán, José, Mosqueira-Rey, Eduardo, Fernández-Leal, Ángel, Hernández-Pereira, Elena, Alonso-Ríos, David, Moret-Bonillo, Vicente, Figueirido-Arnoso, Israel, Vidal-Ínsua, Yolanda
Explainable AI (XAI) [1] is a research field focused on making Artificial Intelligence (AI) systems in general, and Machine Learning (ML) systems in particular, more understandable to humans. Explainable AI offers several advantages, to name a few: it fosters confidence in the prediction of the model by making the decision-making process more transparent, promotes responsible AI development, aids in debugging and identifying issues, and allows auditing of AI models and checking if they adhere to regulatory standards. The inherent explainability of AI systems has not remained static but has changed considerably as a result of technological progress. In fact, explainability has become an increasingly difficult issue to tackle, as the internal functioning of AI systems has become less intelligible as they have become more complex [2]. Initially, symbolic AI models were explainable per se, e.g., rule-based expert systems could easily show to their users which rules they had followed to make a given decision, even though the rules can incorporate measures of uncertainty and imprecision as, for example, in fuzzy systems. These type of AI models are considered transparent, which means that the model itself is understandable [3], being understandability the characteristic of a model to make a human understand its function without any need for explaining its internal structure or the algorithmic means by which the model processes data internally [4].
- North America > United States > New York > New York County > New York City (0.04)
- Asia > Middle East > Israel (0.04)
- South America > Chile > Santiago Metropolitan Region > Santiago Province > Santiago (0.04)
- (4 more...)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Explanation & Argumentation (1.00)
- Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
- (3 more...)
Toward a Team of AI-made Scientists for Scientific Discovery from Gene Expression Data
Liu, Haoyang, Li, Yijiang, Jian, Jinglin, Cheng, Yuxuan, Lu, Jianrong, Guo, Shuyi, Zhu, Jinglei, Zhang, Mianchen, Zhang, Miantong, Wang, Haohan
Machine learning has emerged as a powerful tool for scientific discovery, enabling researchers to extract meaningful insights from complex datasets. For instance, it has facilitated the identification of disease-predictive genes from gene expression data, significantly advancing healthcare. However, the traditional process for analyzing such datasets demands substantial human effort and expertise for the data selection, processing, and analysis. To address this challenge, we introduce a novel framework, a Team of AI-made Scientists (TAIS), designed to streamline the scientific discovery pipeline. TAIS comprises simulated roles, including a project manager, data engineer, and domain expert, each represented by a Large Language Model (LLM). These roles collaborate to replicate the tasks typically performed by data scientists, with a specific focus on identifying disease-predictive genes. Furthermore, we have curated a benchmark dataset to assess TAIS's effectiveness in gene identification, demonstrating our system's potential to significantly enhance the efficiency and scope of scientific exploration. Our findings represent a solid step towards automating scientific discovery through large language models.
- North America > United States > Illinois (0.04)
- Europe > Poland > Greater Poland Province > Poznań (0.04)
- Europe > Netherlands (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Scientific Discovery (1.00)
- Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
- (2 more...)
A new AI-based risk prediction system could help catch deadly pancreatic cancer cases earlier
As a result, it's essential to try to catch pancreatic cancer at the earliest stage possible. A team of researchers from MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL) worked with Limor Appelbaum, a staff scientist in the department of radiation oncology at the Beth Israel Deaconess Medical Center in Boston, to develop an AI system that predicts a patient's likelihood of developing pancreatic ductal adenocarcinoma (PDAC), the most common form of the cancer. The system outperformed current diagnostic standards and could someday be used in a clinical setting to identify patients who could benefit from early screening or testing, helping catch the disease earlier and save lives. The research was published in the journal eBioMedicine last month. The researchers' goal was a model capable of predicting a patient's risk of being diagnosed with PDAC in the next six to 18 months, making early-stage detection and cure more likely.
- Asia > Middle East > Israel (0.26)
- North America > United States (0.06)